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Abstract 

Stochastic models, based on random processes, may lead to power law distributions, which 
provide long range correlations. The observation of power law behavior and the presence of long 
range correlations in biological systems has been demonstrated in various studies. The combination 
of the two just mentioned results, theoretical and experimental, supports strongly the scenario of 
biological evolution across different organisms. In the current Letter we explore in a general way, 
using the algebra of Nonextensive Statistics introduced by Tsallis and coworkers, if the processes 
which are described by a class of stochastic models are really random and discuss the results with 
regard to a possible biological evolution. 
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In the last years a variety of systems in diverse scientific fields has been explored present- 
ingpower law distributions like in biophysics jlj, neurophysics economics 3], turbulence 



5], urban agglomerations 6| and many ot 
many stochastic models have been studied 



lers. Because of the ubiquity of power laws 
9j], which are based on random processes and 



lead to the desired power law distributions. These results have a great impact on our point of 
view about the rules that dominate in nature. Singularly, in biology the success of deriving 
power laws through the consideration of random processes supports strongly the scenario of 
the biological evolution across different species. These models do not give explicit descrip- 
tion of the transition between different organisms during the evolution but they can form 
well, under consideration of biological processes like mutation, duplication, etc., symbolic 
sequences which correspond to polynucleotide DNA chains. For example in Ref. [7| Provata 
used a random aggregation model to describe a possible formation of such a polynucleotide 
DNA chain. The model is based on the probability formula 



introduced by Takayasu and collaborators lCj, LL3| , which describes the probability of finding 



a macromolecule of size s at the time t+At. The analytical expression of Eq. (DO) is computed 
in the transformed Fourier space. A different stochastic approach is considered in Ref. [s| 
by Messer, Arndt and Lassig, who constructed a master equation for the joint probabilities 
P eq (r,t) and P op (r,t) of finding a GC pair and a AT pair, respectively, at a distance r 

j f Pe q (r, t) = 2/i cff [P op (r, t) - P eq (r, t)] 

+ [r«y+(r-l) 7 + ] [Pap(r-l,t)-Peq(r,t)] ■ ( 2 ) 
+ r 7 - [P cq (r + l,t)-P cq (r, t)} 

The exchange P eq (r, t) «-> P op (r, t) in Eq. (j2J) gives the analogous equation for the probability 
P op (r, t). Then, the correlation function C(r, t) = P cq (r, t) — P op (r, t) can be computed. The 
consideration of both Eqs. (p]) and ^ leads, under certain assumptions, to power law 
distributions. 

On the other hand, it is well known that the existence of power law distributions indicates 
the presence of long (or short) range correlations. Yet, this is in contradiction with the 
statistical assumptions of the stochastic models about random processes. It would mean 
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that uncorrelated processes can create, in an unexplained way, correlations between them. 
Even if we assume that in nature such an event (creation of correlations through random 
processes) can be possible, can we assume the same in mathematics? A stochastic model 
is a mathematical construction with concrete rules and it can not lead to the existence of 
correlations if we have not first introduced them in the model. Starting from this point of 
view we shall explore in the next paragraphs whether a class of stochastic models, where 
sums appear like in Eqs. ([I]) and ([2]), describes really random processes. For this aim we 
shall use the g-Algebra [jjl of Nonextensive Statistics [ijj]. The results will be discussed in 
the frame of biology. 

Nonextensive Statistics is a possible generalization of the ordinary random statistics, 
depending on a parameter q G [0,1] (the g^i-branche corresponds to different g-statistics 



14|). Its structure is based on the following deformed logarithm 



with its inverse deformed exponential function 

e g (x) := [1 + (1 - q)x}^ , (x > 0). (4) 

When q = 1 the entire generalized statistical structure recovers the ordinary one. One of the 
main points in Nonextensive Statistics is its successful treatment of correlated probabilities 
(or processes in general), which are projected on the values of the parameter q. 

First, we introduce some elementary notions of the Probability Theory. A probability 
space, which represents our uncertainty regarding an experiment consists of two parts: i) a 
set of events/outcomes, which is denoted as sample space Q and ii) a real function of the 
subsets of fl which is denoted as probability measure P. Now, let us follow the next thought. 
We consider two coins / and II. The sides of each coin have a different symbol, so that the 
sample spaces fli and flu consist of two events, Qi = {A, B} and flu = {A, B}. Then, 
the intersection probability (of both occurring) in each sample space separately is of course 
equal to zero 

P(Af)B) = = P(AnB). (5) 

The union probability (of either occurring) in both spaces separately is equal to the unity 

P(AUB) = P(A) + P(B) 
P(AVJB) = P(A) + P(B) 



1. (6) 
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Eq. ([6]) corresponds in physics to the normalization constraint. In this probability measure 
the event probabilities in each sample space are strongly correlated in such a way that their 
sum is always equal to the unity. We denote these correlations as correlations of Type I. 
On the other hand, with respect to the values of the probabilities, there are infinite ways 
to fulfil Eq. In order to capture all these possibilities, or better to attribute them to a 
cause, we introduce yet another type of correlations, which we call correlations of Type II. 
These correlations are individual for each event. In our example they could represent the 
different weights of each side of the coin. When the correlations of Type II vanish, then we 
have equal event probabilities. 

In further, we join the spaces Qiji. Now, we have one more type of correlations to 
consider, which we denote as correlations of Type III. These correlations are between the 
several combinations of the events. Following these steps, the consideration of more compli- 
cate processes or compositions would lead to an increasing number of the correlation types 
(higher order correlations, Type >III). In shake of the simplicity, we assume that there are 
no correlations of the Types II and III in the new sample space Qm and because of this, 
Qm is a tensor product of both subspaces Qm = Qi x Qn = {AA, AB, BA, BB}. In this 
case the event probabilities P{A), P{B), P{A) and P(B) are equal and independent. Then, 
the intersection probabilities are given by 

P(A nl)= P(A)P(A), (7) 

P(ADB) = P(A)P(B), (8) 

P(BnA) = P(B)P(A), (9) 

P(B f] B) = P(B)P(B), (10) 

and because of the union probability 

P(AA UABUBAU BB) = P(A)P(A) + P(A)P(B)+ 

P{B)P{A) + P{B)P(B) = 1, 

their values are equal to 1/4. Here, there is a very important point to stress. Namely, 
the feature of the probability (in) dependence is meaningful in the frame of an appropriate 
probability measure. As we can see in Eqs. (JTj) - (flOl) . the intersection probabilities of 
Qm are computed based on the independence of the event probabilities between the two 
subspaces Qi and Qn. Yet, the union probability of Qm correlates strongly the intersection 



(11) 



probabilities. Furthermore, it becomes evident that when we refer to correlations we have 
to be very concrete about their Type. To give a more general character to the intersection 
probabilities for the further purposes of this Letter we shall call them Blocks of probabilities. 

In Ref. jl^] the present author demonstrated the importance of the distinction between 
the diverse types of correlations. The existence of two different types of correlations within 
Nonextensive Statistics has been introduced, the inner and the outer, described by a set of 
parameters 1Z = {Ri}i=i,... >n and Q = {Qi}i=i,..., n ' respectively, in order to distinguish be- 
tween the three generalized entropic structures that can be created, based on any generalized 
logarithmic function. The inner and outer correlations in this case should not be necessarily 
identified with ones in our example above. Their notion must be every time adjusted to 
the features of a problem under consideration. For the one parametric (Q = {Qi} = q) 
generalized logarithm ([3]) with its inverse function (j3J) these three structures correspond to 
the Tsallis, Renyi and Nonextensive Gaussian entropy. We shall keep the notation of the 
correlation-sets 1Z and Q throughout the paper. 

For an arbitrary deformed logarithmic and exponential function we define the generalized 
product between two elements x, y 6 M+ as ]j| 

x <g> Q y := e Q (\n a (x) + ln Q (y)) . (12) 

For Q — > <2 we obtain the ordinary multiplication since eg (ln.Q (x) + lng (y)) = 
e (ln(x) + ln(y)) = x x y. Then, for the Q- multiplication of m elements we have 



m I m 



JJ^ x k = e Q [ ^ ln Q(^) ] , (a* > 0). (13) 



k=l 



,fc=l 



Using the deformed functions in Eqs. ([3]) and (jlj), Eq. fflBl takes the explicit form 



m 

nx k 

k=l 
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J2 x l 9 -(m-r 

_k=l 

If we express the normalization constraint of W intersection probabilities Pj through the 
g-algebra we can easily verify the relation 

w w 



y q 

3=1 j=\ 



= I- (15) 

q=0 



The value q = in g-Statistics corresponds to long range correlations. This result reproduce 
correctly the correlations of Type I behind the normalization constraint, which relate all 



probabilities in a very specific way. Since generalized statistics gives us proper results and 
does not lead to inconsistencies we shall use it in the next paragraph in order to complete 
the aim of this Letter. 

We consider the statistical processes of a system and define the probabilities under which 
these processes take place and the way they interact (correlated or uncorrelated) . An exam- 
ple for such a system under consideration could be a DNA chain and a possible statistical 
process could be the mutation of a nucleotide. In connection to our example with the coins, 
all these interacting probabilities define the respective Blocks of probabilities for the current 
approach and are denoted as {A k (Pj)}k=i,...,m- The internal ^-correlations characterize 
the different types of correlations within a Block. These correlations are canceled for certain 
values TZ — > TZq. Now, we want to see how the several fc-Blocks interact with each other. 
These interactions are described by the external correlations Q. Then, we introduce the 
probability functional BsQ^\{Pj) which is equal to the entire internal and external interac- 
tions. It could be a derivative of the probabilities Pj in A k (Pj), as in Eq. (j2j), or simply a 
new probability, as in Eq. (JTJ, etc. The elimination of the external correlations is defined 
again in a certain limit Q — > <2o Then, the entire process is given by an expression of the 
form 

m 

B { qM P j) = ™ ®Q IL (16) 

k=l 

with B{Q t Ti}(Pj) > 0. We stress that the Eq. ffT6l) does not describe exact processes. It 
is a general expression of a set of probability interactions. The Q-multiplication with the 
constant m, does not have a specific meaning but is useful for our purpose. We demand that 
the stochastic processes inside of each Block are independent (1Z — > 7Zo), so that they are 
multiplicatively connected through the ordinary multiplication. Using Eq. (1141) . the relation 
in Eq. (|16|) can be written as 



B { q ,n }{Pj 



m 
k=l 



1 1-9 
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(17) 



with m(q) := m 1 9 (1 — m q ). In further, we set q = as we have done in Eq. ffTol) . Then, 
from Eq. (fTTj) we obtain 

m 

B {qMo} {p 3 ) = Y,[ A ^ ] ^}- ( 18 ) 

k=l 
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Eq. (fT8l) is our main result. Namely, when the probability measure B{ q ji }(Pj) is equal 
to the sum (q = 0) of the probability Blocks, even if the internal processes inside a Block 
are independent, then this measure describes long range correlations. Accordingly, every 
stochastic model, which is based on such a sum, gives a possible creation of sequences 
through a very concrete dynamic, where the processes of the several probability Blocks are 
long range correlated (synchronized). Comparing Eqs. ([I]) and ([2]) with Eq. (fl8l) . we see 
that the two earlier equations are specific expressions of the latter one. 

If we accept that biological evolution, as described by Eq. (Tl8l) . has taken place, then the 
transition between different species during the evolution is attributed to the combination of 
two different kinds of modifications in the ^-correlations in each Block. The first kind of 
modifications do not affect the structure of the given dynamic. This case would correspond 
to the creation of different organisms of the same class. The second kind of modifications 
does affect the structure of the given dynamic. This case would correspond to the creation 
of different classes of organisms. However, the existence of long range correlations has the 
effect that a modification of a single probability causes simultaneously changes in all involved 
terms. This has as consequence that every new created organism does not have the time to 
be adapted to its environment, so that it is able to survive. On the other hand, the adaption 
to the environment is the basis of the natural selection and the biological evolution. As we 
can see, the introduction of long range correlations gives a problematic to the scenario of 
biological evolution, as has been assumed up to now. 

Summarizing, we have shown that the dependence or independence between probabilities 
is associated to an appropriate probability measure. The sum of probabilities or of a Block 
of probabilities corresponds to long range interactions between the participating terms. Ac- 
cordingly, any stochastic model in which such a sum appears, correlates in a very specific 
way, with long range features, the respective processes. The result is a non-exponential prob- 
ability distribution function. Beside the clarification of the appropriate probability measure, 
we need to clarify the Type (or Types) of the correlations as well. There is a variety of dif- 
ferent types of correlations in a process, whose confusion may lead to incorrect results. The 
more complicated a process is, the more different types of correlations are involved in this 
process. The presence of correlations creates difficulties considering the cornerstone of the 
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natural selection in biology, which is the adjustment of an organism to its environment. 
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